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Abstract. The problem of estimating the spectrum of a density matrix is 
considered. Other problems, such as bipartite pure state entanglement, can be 
reduced to spectrum estimation. A local operations and classical communication 
(LOCC) measurement strategy is shown which is asymptotically optimal. This 
means that, for a very large number of copies, it becomes unnecessary to perform 
collective measurements which should be more difficult to implement in practice. 



PACS numbers: 03.65.Wj, 03.67.-a, 03.67.Mn 

1. Introduction 

Estimating a mixed state density matrix optimally, when one has N copies of 
it available, is a difficult problem. The problem has been solved for qubits by 
|Vidal et al., 1999| , |Bagan et al., 2004| and by |Hayashi and Matsumoto, 2004| and 
it is known that optimal collective measurements perform strictly better than 
any measurement which can be implemented with local operations and classical 
communication (LOCC). For mixed qudits, i.e., mixed states on a Hilbert space of 
dimension d 7 not much work on finding optimal collective measurements has been done. 
In the present work a simpler case is studied, the estimation of the spectrum of a qudit 
density matrix. This problem has already been studied from the large deviation point 
of view by |Keyl and Werner, 2001| and for the qubit case by |Bagan et al., 2 005 . 

In addition to being interesting in itself, spectrum estimation is useful because 
other problems can be reduced to it: 

• Estimation of bipartite pure state entanglement. This problem has been studied 
for d = 2 by |Sancho and Huelga, 200"0| and by |Acm et al.,2000| . 

• Estimation of generalized Pauli channel. This problem has been studied by 
Fuji wara and Imai, 2003| and the depolarizing channel (special case of Pauli 
channel) by ^SalialtiTtHu^~2'002| . 

In the present paper, an LOCC asymptotically optimal! strategy will be 
described. The optimality of this LOCC strategy will be established by showing 
that it asymptotically satisfies the quantum Cramer- Rao bound (QCRB), stated by 

| i.e. it performs asymptotically as well as any other measurement strategy 
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|Helstrom, T976| . The QCRB is a bound on the mean square error of "reasonable" 
estimators. 

This paper is organized as follows. In section [21 the necessary concepts are 
introduced and it is specified what is meant by optimality. In section [3] the estimation 
strategy is described and the main result is stated more precisely (equation (@J). In 
section0]the conditional mean square error matrix (MSE) is calculated, this is needed 
for the next two sections. A heuristic argument supporting the main result is given in 
section and a proof will be given in sectional (theorem OJ. 



2. Preliminaries 

The density matrix p (p > 0,trp = 1) will be parametrized in the following way: 

d-l d-l 

P (p) = '£p k \k)(k\ + (i-'£p l )\d)(d\, 
fc=i i=i 

where p£0C is the parameter of interest, 

e = |(pi,..., 



d-l ~j 

i) : < Pk < l,J2Pk < 1 \ 

k=l ) 



is the set of possible values of the parameter, and {|1), . . . , \d)} is a basis of 
eigenvectors. 

The quantum estimation problem that will be studied in this paper is that, 
given N copies of a completely unknown p, one is only interested in estimating its 
eigenvalues. Some of the needed concepts and results will be introduced next for the 
N = 1 case. 

Let M be a measurement with outcomes in a finite set Q, i.e., a collection of 
matrices {Mj : £ £ ft} satisfying > and J2^en = an d let p = (pi,... ,Pd-i) 
be an estimator of p, i.e., a map from £1 to 0. The performance of such a measurement- 
estimator pair will be quantified by the MSE 

MSE(p,p,M) fci - E[(p k -p k )(pi -Pi)] =^tx\p(p)M ( .](p ik -p k )(S> i i -pi), 

where E/ means expectation of /. 

The QCRB states that any unbiased§ measurement-estimator pair (p, M) of p 
satisfies 

MSE(p,p,M) > Hip)- 1 , 

where H is the quantum Fisher information (QFI) (see for example |Helstrom, 1976| 
or |Holevo, 1982| ). The QFI can be defined as the matrix with elements 

H(p) kl =Reti\p(p)\ k (p)\ l (p)}, 

where {Ai(p), . . . , Xd-i{p)} are the symmetric logarithmic derivatives (SLD). The SLD 
are defined as selfadjoint solutions to the equation 

d kP (p) = P(P)^(p)+A fc ( P )p( P ) > (1) 
§ Unbiased means that 

Sen 
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where d k means partial derivative with respect to pk ■ 

The SLD for the model studied in this paper are easy to calculate, indeed, writing 
on the basis of eigenvectors we get 

{im {k\-\d){d\]\j)=^^{i\X k (p)\j), 

or 

\k)(k\ \d)(d\ 

Afe(p) = ■ 

Pk Pd 

From the SLD one can then calculate the QFI to get: 

H( P ) kl = — + — , Me{i.»->d-i} 

where p<j = 1 — Yli=i Pl> the inverse of JT is 

H{p)li = PkSki - PkPu k, I G {1, . • . , d- 1}. (2) 

When one has iV copies of p, i.e., the model is of the form p(p)® N the QCRB 
becomes 

MSE(p,p,M)W>^^, 

and this bound is valid for any measurement M (i.e. LOCC or not), as long as the 
measurement-estimator pair (p, M) is unbiased. 

The class of unbiased estimators, however, is too restrictive since in most practical 
situations one deals with biased ones. [Cifi^nd Levit, 1995| used a multivariate 
extension of an inequality due to |van Trees^H)68r"to prove a more gene ral bound. 
From their result and an inequality due to |Braunstein and Caves, 1994| , it can be 

shown that, under some regularity conditions, if V~N{p — p) —* Z(p) then 

V & vZ(p)>H{p)-\ (3) 

where means convergence in distribution. This means that the variance of the 
limiting distribution of any regular estimator satisfies the QCRB. 



3. Estimation strategy 

Suppose now that one knows the basis of eigenvectors, and let us consider the 
measurement with elements Mk — \k){k\. For this measurement the probability of 
outcome k is 

tx[p{jp)M k ] =p k . 

Now suppose this measurement is performed on N copies of p, let N k be the number 
of times that outcome k was observed, then {N%, . . . , A^-i} have a multinomial 
distribution, i.e., 

ATI A 

Pr(7V! = m,..., Ar d _ x = na-i) = , ■ ]J P n k \ 

Y\k=l n k ] - k =l 

where = N — X)fc=i n k- The estimator 



Pk = lv 
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is unbiased and a simple calculation shows that its MSE equals the inverse of the QFI 
divided by N which means that it saturates the QCRB and therefore it is optimal. 

This would be the whole story, except for the fact that we have assumed that the 
eigenbasis of p is known. If the eigenbasis is not known one can try to use a two-step 
adaptive strategy such as the one considered by |Gill and Massar, 2000| . The idea 
is to make an initial rough estimate of p on an asymptotically vanishing fraction of 
the copies, e.g., with < p < 1. Let a be that initial estimate of p and \i[>k) be 
its (not necessarily unique) eigenbasis. On the rest of the copies (N — iV*) of p, the 
measurement with elements Mk = IV'fcXV'fcl is performed. 

In the rest of this paper, it will be shown that this method is asymptotically 
optimal, i.e., it asymptotically achieves the QCRB: 

lim NMSE(p,p,M) (N) = Hip)' 1 , (4) 
provided p is chosen strictly larger than 1/2. 



4. The MSE in the adaptive scheme 

Let Ni = and Nf — N — be the sample sizes for the first and second stages 
respectively. In the second stage, the probability of outcome fc, given the initial 
estimate a, is 

q k = tvMkpip) = (ipk\p(p)\ipk)- 

These probabilities are also a random variable. 

Next the MSE of the second stage (i.e. assuming fixed q's) will be calculated. A 
condition for obtaining Q will be derived from it. 

Just as before, let AT& be the number of times that outcome k is observed and let 
us estimate pt as 

- N k 

Pk = W 

The expectation of this estimator conditioned on cris 

E[pfc|a] = qk, 

so that in general it is a biased estimator. A simple calculation shows that the MSE 
conditioned on the first rough estimate of p is 

E[(Pfc -Pk)(pt -Pi)\<r] = qk6u AT + (Pk - qk)(pi - ?;)> ( 5 ) 

the second term is the square of the bias, the MSE itself is 

MSE(p,p,M)W =E[E[(p k -pk)(pi-pi)\o-}]. 

Comparing and J^J and using the fact that N/Nf — > 1 as N — > oo, it is easy 
to see that in order to get JIJ it is sufficient that 

lim E[N(q k -pk)(qi-pi)] = 0. (6) 

N— >oo 

Indeed, if this is true, then it also holds that — ► pk and E[#fc<#] — > PkPi- 
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5. Heuristic argument 

Suppose for simplicity, that all eigenvalues of p are different, then one expects that 
after the first estimate, the eigenbasis of p and the eigenbasis of a are related by a 
unitary matrix which is very close to the identity, i.e., 

iv^> = u\k), 

with 



where {T\, . . . , T' d 2_ 1 } is a basis of su(d) satisfying tr T a Tp — S a p, r\ E R d 1 and \\ij\\ 
is small. One can then expand U in Taylor series about rj = 0, 

U = l + i V -T-±(n-T) 2 + o(\\ V \\ 2 ). 

For any decent initial estimation strategy, rj is expected to go to as N — > 00 at a 
rate of N~ 1/2 = N~^ 2 . 

The expression for q k is 

q k = Y,PlW\U\k)\\ 
I 

and 

\(l\U\k)? = hi + (1\V ■ T\k)(k\n ■ T\l) - S kl (k\( V ■ Tf\k) + o(h|| 2 ), 

therefore 

q k -Pk - (k\(v ■ T)p( V ■ T)\k) - Pk (k\( V ■ T) 2 \k) + o(\\ V \\ 2 ). 

From the previous expression and the fact that 77 goes to zero at the rate 7V -A1 / 2 one 
can expect that 

where c is a constant possibly depending on p. From the previous equation it follows 
that 

lim NE(q k - p k ) 2 = 0, (7) 

n— >oo 

if and only if \i > 1/2. Now, using JZJ together with the Cauchy-Schwartz inequality 

(E[JV(g fc - Pk ){qi - pi)]) 2 < nN(q k - Pk ) 2 ] E[N( qi - Pl ) 2 }, 
J^J follows. As pointed out before, the desired result Q is a consequence of ©. 



6. Rigorous argument 

6.1. Some intermediate results 

If p = 1/d, then any basis chosen for the second stage will give (qk — p k ) = 0, so in 
what follows it is assumed that p ^ 1/d, i.e., p has at least two different eigenvalues. 

The following intermediate result will be needed. Basically it states that if p and 
a are close to each other, then so will be their eigenvalues and eigenspaces. 
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Lemma 1. Let 

n 

P = ^PaRa, 

a=l 
d 

o- = 22s k \ip k )(ipk\, 

fe=l 

where p a =^ pb for a b, 2 < n < d is the number of different eigenvalues and IL a is 
a projector onto the eigenspace corresponding to eigenvalue p a , and let d a = trH a be 
the degeneracy of p a , also let 

A = minmin \p a — p&l > 0. 

a b=£a 

V 

then 
(i) Va, k 



d H s(p, cr) = A/tr(p - cr) 2 < 6 < 



1 + Vd' 



\Pa ~ Sfc| V WkWabh) < $i 

i.e., either p a is close to s^ or \ip^) is almost orthogonal to the eigenspace 
corresponding to p a . 

(ii) Va 3k such that \p a — Sk\ < S and Vfc 3a such that \p a — Sk\ < §, i.e., every 
eigenvalue of a is close to an eigenvalue of p and vice versa. Let M a = {k : 
\p a - Sk\ < 3} and m a = \M a \ > 0. Note that M a n M b = for a^b. 

(Hi) Let a^b, then if k £ Mb, then \p a — Sk\ > A — 5 and 

V / WW< 



a-<t 



i.e., if Sk is within a distance S of pb ^ p a , then \tpk) * s almost orthogonal to the 
eigenspace corresponding to p a . 

(iv) m a — d a , i.e., ford small enough, the number of eigenvalues of a within a distance 
S from p a is equal to the degeneracy of p a . 

(v) Vfc e M a , 

\p a - (i>k\p\i>k)\ < c(p)s 2 , 

where 

cM - 

The proof of this lemma is given in |Appendix A| 

Now the way in which the first rough estimation is done will be specified. For 
this part it is convenient to represent p and o in the following way 
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The initial measurement strategy (which will be called plain tomography) is to divide 
the initial number of copies Ni in d 2 — 1 groups of size A*o = JVj/ (d 2 — 1), and in group 
a € {1, . . . , d 2 — 1} perform the measurement 

± 2 

The probabilities are 

(a) 1 ± ®a 

py = —j— 

Let w Q+ be the number of times that outcome + was obtained, it is binomially 
distributed w a + ~ Bin(iVo, (1 + 9 a )/2). The estimator for 6 a is taken to be 

e a = 2 ^±-i. 

The following result holds: 

Lemma 2. 7//i > 1/2 then Ve > and Wi > 

lim f^' 1 Pr \VN\q k - Pk\ > el ) = 0. (8) 

The proof of this lemma is given in |Appcndix B| 

6. 2. Proof of the main result 
Theorem 3. If fi > 1/2 then ^ holds. 



Proof. Let X k ' = V N(qk —pk), clearly (X^ ') < N. All that needs to be proven is 
that 

lim E[X ( k N) xl N) ] = 0. 

We have that 

\E[xi N | < E[|xf < ^E[(xf')2]E[(^)2], (9) 

where in the second inequality the Cauchy-Schwartz inequality has been used. Now 
choose any e > 0, 

E[(4^) 2 ]=£zPr[(xf>) 2 =z] 

= £ xPrKxf^^xl+^xPrKX^)^.] 

0<x<e 2 x>e 2 



< e 2 Pr[(Xi N) ) 2 < e 2 ] + NPr[(X k * ) f > e 2 } 
<e 2 + . 

using now JHJ one gets that Ve > 0, 



<e 2 + NPr\\xi N) \ >el 



lim E[(xi N) ) 2 } < e 2 , 



which implies that it must be zero; this fact and @ imply (|HJ and therefore the desired 
result Q. □ 
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We have proven something about the limit of the MSE, but © is a bound to the 
variance of the limiting distribution. However, since the limit of the MSE cannot be 
smaller than the variance of the limiting distribution (which in this case can easily be 
proven to be Gaussian) it follows that our estimator achieves the bound ©. 

7. Estimation of bipartite pure state entanglement 

A bipartite entangled pure state \iPab) € H.a ® Kb can be written as (Schmidt's 
decomposition) 

d 

Wab) = ^2VPk \k) <8> \e k ), 
fe=i 

where {\k)} and {|efc)} are orthonormal basis of Ha and Hb which are both of 
dimension d. 

The entanglement of \i[>ab) can be calculated as the entropy of one of the reduced 
states, 

d 

E(\iPab)) = -tr(p A \og 2 PA) = -^Pfclog 2 p fc , 

k=l 

where pa = trg \'>Pab)('4 } ab\, i-c, entanglement is a function of the eigenvalues of 
the reduced density matrix. This means that entanglement can be estimated by 
performing measurements on pa only, in order to estimate its spectrum. The question 
is whether this procedure is optimal. A quick calculation of the QFI for the parameters 
Pk in the model given by \iPab) shows that indeed the entanglement of \iPab) can be 
optimally estimated by estimating the spectrum of pa using the procedure described 
above in this paper. 

The same result|| was obtained by |Acm et al., 2000] for d = 2 using other tools. 

8. Conclusions 

The estimation of the spectrum of a finite dimensional density matrix has been 
analyzed. The following LOCC procedure has been studied: 

(i) Perform the so called plain tomography on copies where p, > 1/2 and N is 
the total number of copies. From this one gets an initial estimate of the whole 
density matrix, call it a. Let . . . , \ipd) be a set of eigenvectors of a. 

(ii) Perform the measurement with elements Mk = |f/'fe)(V ; fc| 011 the remaining N — N 11 
copies and estimate pk as the number of times the outcome fc was obtained divided 
by N. 

It has been shown that the above procedure performs asymptotically as well as any 
measurement (including collective ones). This means that in the asymptotic regime 
there is no need to perform the more complicated collective measurements for the 
estimation of the spectrum of a density matrix (or pure bipartite entanglement) . 

| That entanglement can be optimally estimated by estimating the spectrum of the reduced density 
matrix. 
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Appendix A. Proof of lemma 111 

(i) The square of the distance between p and a can be written as 

d n 

d HS (p,°-) 2 = J2Y. ^ i (p - CT ) n « I o° - 

k=l a=l 
d n 

= ^]Tba-s fe ) 2 (^|n a ^ fe ) <s 2 . 

k=l a=l 

Since all terms are nonnegative, this implies that all of them are less than or equal 
to 8 and this implies point (jJJ. 

(ii) For point ijujl. only the first statement will be proven, the proof of the second is 
almost identical. Suppose that the opposite is true, i.e., that 3a such that Vfc 
\p a — Sk\ > S then 



k=l 6=1 



>^(p a ~ Sfc ) 2 (v^|n Q |Vfc) 



S2 j_„ TT \ JC2 



> 5 Z tr n a > 5 Z 
i.e., d_ys(p, cr) > 5 which is a contradiction. 

(iii) \p a - s k \ = \(p a - Pb) + (pb - Sfe)| > \p a - Pb\ - W - s k \ > A - 5, the second 
statement follows from the previous inequality and point (J. 

(iv) 

keM a ke.M a 

= trn o - (V'fclHalV'fc) 



>trn a - 



5 ^ 2 



> trlL, 



A-<5. 

2 

where point (fTTTf) has been used. Now, since d a = trll a , we get 

2 

-A - d , 



m a > d a - d 
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6 



A-S 



< 1, 



and since m a is an integer, we have that m a > d a . Using the fact that 
J2 a m a ^Ea^^ d , we S et tnat m a = d a . 
(v) Let a^6, and k e M a 



\Pa - PbW {lpk\H-b\4>k) = \(Pa- s k) + (Sk ~Vb)W (^fc|nb|V>fe) 
< [\Pa ~ Sk\ + \s k ~Pb\W (lpk\^b\tpk) 
< 



< 



6y/(ip k \nb\il>k) + \s k -pbW(^k\Tlb\lpk) 
< 25, 



where points 10) and (|nj have been used. Thus, we have that 

4<5 2 

(ip k n 6 Vfe) < 7 

(Pa - Pb) 

Now I turn to the quantity of interest, 



\Pa - (tpk\p\lpk)\ = 



Pa - y^Pb(^fc|n fe |yj fc ) 

b 

^2(Pa ~ Pb)(^k\ttb\lpl 
b 

< X \ Pa ~Pb\{i>k\^-bWk, 
b 



b^a 



\Pa~Pb\ 



< 



^8 2 = c{p) 6 2 .n 
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Now we enumerate the eigenvalues of p from 1 to d again, with some of them possibly 
equal. Points ijuj and (|rvjl of lemma ^ take care that for every eigenvalue of p, the 
right number of eigenvalues of a will satisfy point (0). From point (0) of lemma ^ we 
get that \qk — Pk\ > c (p)S 2 implies d(p, a) 2 > S 2 , we have 

Pr [\q k -Pk\> c{p)6 2 ] < Pr [d(p,a) 2 > S 2 ] 

2 -i 

Pr ' " 



Since 



a 17 a 



X><*-<U 2 ><5 2 



) 2 >5 2 
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7 a 17 a ) 



> 



it follows that 



]T (9* - 9 a f > S 2 



a=l 



< 1 - Pr 



Va, 



<P- 1' 



9 Q ) 2 < 



i - n pr 

Q = l 

d 2 -l 

i n pr 

a=l 

d 2 -l , 

i- n !- pr 



d 2 - 1 
(5 



x/d 2- ! 



-iVn 



< 



No 



a=l 



1 



Wo 



< 1 - 1 - 2 exp 



2 

-7V 



2 Vrf^T 

7V 



> 



2 \/rf2 



2(d 2 - 1)' 

In the last inequality we have used a form of the Chernoff bound^f. Thus, we finally 



have that 



Pr [k -Pk\> c(p)S 2 ] < 1 - ( 1 - 2 exp 



2{d? - 1) 



Nn 



now let c(p)S 2 = eN 1 / 2 and substitute N by its value, N^/^d 2 — 1), the result is 

-,x d 2 -i 



Pr 



N\q k ~Pk\ > e 



< 1 - 1 - 2 exp 



< 2{d 2 - 1) exp 



2c(p)(d 2 - l) 2 



2c(p)(d 2 - l) 2 

multiplying by N , taking > 1/2 and iV — * oo, we get the desired result (JSJ. 
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